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Abstract 

Background: Lag phase is a period of time with no growth that occurs when stationary phase bacteria are 
transferred to a fresh medium. Bacteria in lag phase seem inert: their biomass does not increase. The low number 
of cells and low metabolic activity make it difficult to study this phase. As a consequence, it has not been studied 
as thoroughly as other bacterial growth phases. However, lag phase has important implications for bacterial 
infections and food safety. We asked which, if any, genes are expressed in the lag phase of Escherichia coli, and 
what is their dynamic expression pattern. 

Results: We developed an assay based on imaging flow cytometry of fluorescent reporter cells that overcomes the 
challenges inherent in studying lag phase. We distinguish between lag 1 phase- in which there is no biomass 
growth, and Iag2 phase- in which there is biomass growth but no cell division. We find that in lag 1 phase, most 
promoters are not active, except for the enzymes that utilize the specific carbon source in the medium. These genes 
show promoter activities that increase exponentially with time, despite the fact that the cells do not measurably 
increase in size. An oxidative stress promoter, kotG, is also active. When cells enter Iag2 and begin to grow in size, they 
switch to a full growth program of promoter activity including ribosomal and metabolic genes. 

Conclusions: The observed exponential increase in enzymes for the specific carbon source followed by an abrupt 
switch to production of general growth genes is a solution of an optimal control model, known as bang-bang control. 
The present approach contributes to the understanding of lag phase, the least studied of bacterial growth phases. 

Keywords: E. coli, Lag phase, Resource allocation, Optimal control theory, Bang-bang control, Pontryagin maximum 
principle, Transcriptional program, Stringent response 



Background 

When bacteria are inoculated into fresh medium, they 
often show a period without growth known as the lag 
phase [1-3]. Lag phase is interesting as a fundamental 
biological process in which bacterial physiology adapts 
to a new environment. Lag phase is also of interest in 
fields such as food safety- in which lag phase is one factor 
in determining food shelf life [4-6]. In medicine, lag phase 
plays a role when bacteria move into the blood stream or 
other locations where rapid growth can occur. In both 
food safety and medicine, the longer the lag phase the 
longer body defenses have time to fight pathogens, and 
the longer the natural gut flora have to out-compete the 
pathogens. In some cases, longer lag phase is associated 
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with increased invasiveness of bacterial cells [7,8], or with 
antibiotic tolerance [9]. 

There are several definitions of lag phase [2-4,10-18] -See 
Additional file 1: Table SI for details. For clarity, we divide 
lag phase into two sub-phases: lagl is the period between 
inoculation and until biomass begins to grow, and lag2 is 
the period between the end of lagl and the time that cell 
numbers begin to increase (Figure 1A, B and Additional 
file 1: Figure SI). Lagl has also been called the adaptation 
phase [13], a term we do not use because of the multiple 
meanings of adaptation in sensing and evolution. Note 
that lagl is not adaptation by mutation, but rather 
acclimatization of the cell physiology to new conditions. 
The name 'lag2' is contradictory in the sense that it is a 
phase of growth, namely the first cell generation. We 
however chose this name to describe the second part of 
the lag phase because cells do not grow in number, which 
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Cells were analysed using standard 
and imaging flow cytometry. 

Figure 1 Definition of lagl and Iag2 phases, and the experimental system. (A) A schematic representation of mean cell biomass over time, 
with separation into phases. In lag 1 , biomass does not measurably increase. The end of lagl is operationally defined using the biomass (or cell 
length) curve, at the intersection between the constant biomass line at the beginning of the curve, with the extrapolated exponential increase in 
biomass in Iag2 (black circle). In Iag2, biomass increases. The end of Iag2 is defined by the first division events. At this point biomass plateaus 
(or peaks and then declines), as cells reach a biomass characteristic of exponential growth. See Additional file 1: Table SI, and Figure S1. 
(B) A schematic representation of mean cell number over time, with separation into phases. The end of Iag2 is operationally defined by the 
intersection between the constant cell number line at the beginning of the curve, with the extrapolated exponential increase in cell number 
(green rectangle). The end of exponential phase is operationally defined by the intersection between the increasing cell number line, with the 
constant cell number line at the end of the curve (white triangle). See Additional file 1: Table SI, and Figure S1. (C) Schematic description of the 
experiment: cells were grown for 48 hours in rich defined medium, inoculated into poor or rich defined medium, grown for various amounts of 
time in a robotic system, and analyzed by standard flow cytometry and by imaging flow cytometry. 



is why previous works considered this phase to be part of 
lag phase. 

Operationally, the length of lagl phase is determined 
in a logarithmic plot of biomass or a related quantity 
(cell length, optical density, dry weight) by extrapolating the 
exponential growth of biomass and finding its intersection 
with the initial biomass (Figure 1A). The end of lag2 can be 
similarly defined from a logarithmic graph of cell numbers 
(Figure IB). 

When cells are grown overnight in a rich medium and 
then transferred to a poor medium, both lagl and lag2 
phases may occur. Mathematical models of lag phase 
include Baranyis model [19] in which biomass production 
relies on accumulation of bottleneck proteins in lag phase 
[18,19], and models which focus on cell-cell variation in 
lag phase duration [4,20] . Each individual cell has lagl and 



lag2 phases, but the duration of these phases varies 
stochastically between individual cells, as is evident in 
the measurements of Metris et al. [21] (Additional file 1: 
Figure SI). One possible origin of these cell-cell variations 
is stochastic variation in the levels of enzymes present in 
the cell [22-25]. 

Individual cell measurements also show that the cell 
cycle accelerates during the first few divisions [20,21,26], 
and thus it takes several generations to reach the maximal 
exponential growth rate. Thus, lag2 as defined here is only 
the first generation in a prolonged acceleration phase [3] 
(Additional file 1: Figure SI). 

Studies of gene expression in bacterial lag phase are 
relatively few [27-29]. Several studies have focused on par- 
ticular genes, such as the transcription regulator fis [30] 
and ribosomal genes [31-33]. A recent study explored the 
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genome-wide expression patterns of Salmonella Typhimur- 
ium upon transition from stationary phase in rich medium 
(48 hours growth in LB) to fresh LB medium [34]. In this 
case, bacteria seem to make biomass almost immediately 
(lag2), making it difficult to resolve lagl phase. The expres- 
sion program in lag2 was similar to expression in exponen- 
tial growth, with some differences in genes related to metal 
ion accumulation, phosphate accumulation and oxidative 
stress. Thus, current knowledge is greatest for lag2, the 
phase in which biomass grows, suggesting that it is similar 
in many ways to exponential phase. 

In contrast to lag2, there is little knowledge about lagl 
phase, the phase in which biomass does not grow. This 
lack of knowledge is due in part to the technical difficulty 
of studying lagl, and in part to the fact that commonly 
used protocols go from rich medium to rich medium so 
that lagl is very short. It is therefore not clear whether 
the expression program in lagl is fundamentally different 
from the expression in lag2 and exponential growth. For 
example, one may imagine several possibilities: either no 
expression in lagl of any gene, expression that is directed 
only towards selected genes, or expression of the same 
genes as in exponential phase but at low intensity. 

Here, we address the nature of lag phase by developing 
an assay for measuring cell size, cell number and promoter 
activity at high accuracy in individual bacterial cells in 
lagl and lag2 phases. This assay allows us to overcome the 
problem of low signal to background that occurs in 
culture-based assays of lag phase that rely on fluorimeters 
for optical density (OD) and fluorescence measurements. 
We found that the expression program in lagl phase is 
very different from that in lag2 and exponential phases. 
Most promoters are not measurably active in lagl phase. 
The promoters of the utilization operons for the specific 
carbon source in the medium are, however, highly active 
in lagl phase showing an exponential increase with time, 
despite the fact that biomass does not measurably increase. 
In lag2 and exponential phase, utilization genes, ribosomal 
genes and a wide range of growth genes are expressed 
together. This suggests a 'bang-bang' control strategy, 
known from optimal control theory in engineering: first 
concentrate resources on generating utilization genes for 
the specific carbon source, so that carbon can flow into 
energy production and building blocks (such as amino 
acids), and only then make ribosomes to produce biomass. 
The seemingly inert lagl phase thus shows selective and 
strong expression of specific genes. 

Results 

Flow cytometry assay on fluorescent reporter cells allows 
differentiation between bacterial lagl and Iag2 growth 
phases 

To measure promoter activity, we used strains from the 
E. coli reporter strain library previously developed in our 



lab [35-37]. Each strain bears a low-copy plasmid with 
fast folding GFP under the control of a full length copy 
of the promoter of interest. 

A technical challenge for studying lagl is low cell 
density: one cannot use the reporter strains to measure 
promoter activity during lagl phase in a culture using a 
multi-well fluorimeter, because cell density and fluores- 
cence is below the background detection limit [4,38]. One 
can use a high inoculum level (low dilution of stationary 
phase starters) to bypass the OD detection threshold, but 
this can affect lag phase duration and behavior due to the 
relatively high concentration of stationary phase stimulatory 
and inhibitory molecules [39-41], and due to quorum 
sensing [42-44]. Thus, we developed a flow cytometry 
assay to measure fluorescence and cell size from individual 
cells grown in batch culture (Figure 1C). 

To prepare the cells for flow cytometry at multiple 
time points after inoculation, we used a robotic liquid 
handling system to transfer cells from stationary phase 
culture to 96- deep well plates in a time-delayed manner. 
Wells were inoculated at varying temporal intervals, 
ranging from 5 to 180 minutes. At the end of this proced- 
ure, the multi-well plate had cells grown in fresh medium 
for different amounts of time post inoculation, ranging 
from zero to 15 hours. Cells were kept on ice and mea- 
sured in flow cytometers, so that samples represented 
a post-inoculation time course. 

We used flow cytometry to measure cell number and 
individual cells' fluorescence. By viewing individual cells, 
flow cytometry overcomes the signal to noise problems 
of low density batch cultures. We measured total cell 
density by the number of cytometry counts at a given 
time point. This measurement of cell density agreed very 
well with colony-forming-unit measurements on agar 
plates (Additional file 1: Figure S4). 

We also measured cells in an imaging flow cytometer, 
which provides images of individual cells. These images 
allowed a direct measurement of cell length. We used 
cell length to operationally define the end of lagl phase, 
at the intersection between the initial cell length line at 
the beginning of the curve, with the extrapolated expo- 
nential increase in cell length in lag2 (Figure 1A, black 
circle). The end of lag2 is operationally defined using the 
cell number curve: it is the intersection of the cells' ex- 
ponential growth line with the line of initial cell number 
(Figure IB, green rectangle). The end of exponential 
phase is similarly defined as the intersection between 
the exponential growth line and the line of final cell 
density (Figure IB, white triangle). 

We studied cells transferred from a stationary phase 
culture in defined M9 medium with glucose (0.2%) and 
casamino acids (M9CGLU), into fresh M9 medium with 
no casamino acids in which a given sugar (arabinose) is 
the sole carbon source at saturating (0.2%) concentrations 
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(M9ARA). As a control, we also studied cells inoculated 
into fresh M9CGLIL We found that for M9ARA, a sig- 
nificant lagl phase was seen lasting for 3 ± 1 hours 
(Figure 2A). After the lagl period, cells began to grow 
in length and entered lag2. The duration of lag2 was 
5.4 ± 1.2 hours. Thus lagl + lag2 lasted for 8.4 ± 1.2 hours. 
When cells entered exponential phase, the rise in cell 
numbers coincided with the plateauing of mean cell 
length (Figure 2B and Figure 2 A respectively). This is 
because cell divisions prevent a further rise in mean 
cell length. The exponential growth rate was 0.76 ± 
0.07 hour" 1 , corresponding to a generation time of 55 ± 
5 minutes. 

In contrast, transfer to the richer medium with glucose 
and amino acids (M9CGLU) showed immediate growth 
in cell size, and thus cells entered lag2 without a meas- 
urable delay (Figure 2C). Here, the rise in cell number 
is also coupled to the peak of mean cell size (Figure 2D 
and Figure 2C respectively). The duration of lag2 was 
2.2 ± 0.5 hours, the exponential growth rate is 1.4 ± 
0.4 hour" 1 , and the exponential generation time was 
29 ± 8 minutes. When cells entered stationary phase, 
their mean length decreased back to its pre-inoculation 
level (Figure 2C). 



In lagl phase, genes for arabinose utilization are expressed 
exponentially with time despite no growth in biomass 

We measured the fluorescence of the reporter cells for 
genes related to arabinose utilization in fresh M9ARA 
medium. We found that the fluorescence of the arabinose 
utilization genes rises exponentially with time during lagl 
and lag2 phases. The reporters for the catabolic operon 
araBAD, and the transporters araE and araFGH, show an 
exponential rise in fluorescence despite no measurable 
increase in biomass in lagl. araBAD expression rose by 
a factor of about 60 over the eight hours of lagl + lag2, 
(Figure 3A), corresponding to an exponential rate of 
0.45 ± 0.02 hour" 1 and a doubling time of 92 ± 4 minutes. 
Similarly, araE showed an exponential increase of about 
60 fold, at rate 0.44 ± 0.03 hour" 1 . This doubling time is 
nearly twice the cell doubling time of 55 ± 5 minutes in 
exponential phase in the same medium. As cells leave lag2 
phase and begin to divide, these reporters reach a steady- 
state of fluorescence levels, because cell divisions dilute 
the GFP produced by these promoters. 

One possible explanation for the exponential rise in 
reporter fluorescence during lagl is an auto-catalytic 
process in which cells devote their gene expression re- 
sources to utilizing the sugar; the more transporters 
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Figure 2 Cell length and cell number measurements allow growth curve segmentation into lagl and Iag2 phases. (A) Mean 
bacterial length in minimal arabinose medium (M9ARA). 1 urn = 3 pixels. The blue line is a guide to the eye. (B) Mean cell numbers (in 20 ul 
sample) in M9ARA medium. The red line is a fit to the optimal control model. (C) Mean bacterial length in minimal glucose + amino-acid 
medium (M9CGLU). 1 urn = 3 pixels. The blue line is a guide to the eye. (D) Mean cell number (in 20 ul sample) in M9CGLU medium. The red 
line is a fit to the model. Error bars indicate standard error. 
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Figure 3 On arabinose minimal medium, the arabinose promoters are selectively activated during lagl. Mean GFP fluorescence of the 

cells in minimal arabinose medium (M9ARA), for different reporter strains. Growth phases are indicated. Data for each reporter was normalized by 

its mean value. Lines are guides to the eye. Error bars indicate standard error. (A) Normalized mean fluorescence of the arabinose system 

reporters, and of katG. See Additional file 1: Figure S3. (B) Normalized mean fluorescence of ribosomal genes reporters. (C) Normalized mean 

fluorescence of amino-acid genes reporters. (D) Normalized mean fluorescence of the rhamnose, maltose and lactose system reporters, 
k J 



available, the more the sugar can be utilized giving rise 
to more expression and so on. 

Individual cell data from the flow cytometry experiment 
indicates that the expression of the arabinose system 
promoter increases with time for all cells, and is not an 
all-or-none phenomenon in which some cells express the 
operons and others do not (Additional file 1: Figure S2). 

In lagl phase, growth genes are not measurably expressed 

We also tested ribosomal promoter activity in M9ARA 
medium, using relevant reporter strains (rplN, rrnB, rrlA, 
rimj). We found that ribosomal reporters did not show an 
appreciable rise during lagl phase in arabinose (Figure 3B). 
When cells began to grow in size (lag2 phase), the ribo- 
somal promoters were activated (Figure 3B). This lag2 
increase appears mild in the case of rrnB due to its 
relatively high initial level The promoters began to ap- 
proach a steady-state activity levels in exponential phase, 
when their mean GFP production rates are balanced by 
the dilution rate due to cell division. 

We also tested amino-acid biosynthesis promoters for 
arginine, phenylalanine and serine {argA, argl, pheL and 
serA). We found that these promoters were not measurably 
active during lagl on arabinose (Figure 3C), even though 
the medium did not contain amino acids. These promoters 
became active only in lag2, rising at different rates, ap- 
proaching a steady-state in exponential phase. 



We tested catabolic genes for other sugars, rhamnose, 
maltose and lactose (rhaB, malK and lacZ). We found that 
these promoters were also not measurably active in lagl or 
thereafter (Figure 3D). Thus, cells seemed to selectively 
express the utilization genes for the nutrient in the medium 
(arabinose) in lagl, and not utilization genes for other sugars. 

To extend our survey of promoter activity in lagl phase, 
we performed a screen using 140 additional promoters, 
with diverse biological functions, in M9ARA medium. The 
promoters controlled genes involved in translation, tran- 
scription, global regulators, amino acid biosynthesis, oxida- 
tive stress, phosphate and metal transport and metabolism, 
cold shock, heat shock and global stress (see Additional 
files 1, 2 and 3). To allow a large screen, we used fewer 
time-points: three in lagl and one in lag2 phase. No growth 
related promoters (ribosomal, transcriptional, translational, 
amino-acid biosynthesis, etc.) showed a measurable pro- 
moter activity. Genes with a significant increase in lagl 
were re-tested at higher temporal resolution. We find that 
only one promoter reproducibly showed more than a 3- 
fold increase during lagl, the oxidative stress promoter 
katG (Figure 3A). However, it does not rise as rapidly as 
the arabinose promoters. We also found a minor increase 
in some of the iron metabolism and transporter promoters 
(see Additional files 2 and 3). 

When cells were inoculated into glucose and amino acid 
medium (M9CGLU medium), instead of arabinose, they 
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skipped lagl and entered lag2 immediately. We found that 
the arabinose utilization promoters were not active during 
lag2 or exponential phases, and slightly rose at stationary 
phase when glucose is depleted and cAMP rises (Figure 4A). 
Ribosomal promoters were active immediately after in- 
oculation, during lag2 and exp. phase (Figure 4B). The 
amino-acid promoters were mostly activated towards 
the end of the exp. phase, when the medium ran out of 
amino-acids (Figure 4C). 

Results are consistent with optimal control principle of 
'bang-bang 7 control 

Taken together, these results suggest that in lagl on arabin- 
ose, cells mostly focus their promoter activity on the ara- 
binose utilization genes. They do not activate the vast 
majority of promoters, including ribosomal genes, amino 
acid biosynthesis genes or utilization genes for other sugars. 

This is analogous to the control theory strategy known 
as bang-bang control. A bang-bang strategy posits that in 
order to produce a given amount of output in minimal 
time, invest first in producing the limiting component 
only, and then switch abruptly to making the other 
elements in the system. This is in contrast to a different 
strategy in which one makes all components at a given 
ratio throughout time. In lagl, cells devote their re- 
sources to making specific utilization proteins, and 
then, in lag2, switch to making ribosomes together 
with more utilization genes. To ask whether this strat- 
egy is optimal for making the most cells at a given time 
Tafter inoculation, we analyzed a mathematical model 
based on Baranyi's model for bacterial growth [19]. 
The model, outlined in Figure 5, provides equations for 
the production of a bottleneck enzymes P which provide 
the substrates for producing biomass M (more details in 
the Methods section). One seeks the best way to temporally 
allocate production resources between making P and mak- 
ing M, in order to double the biomass in minimal time. 

The model can be exactly solved, using an optimal 
control method known as Pontryagin's maximum 
principle [45,46]. The exact solution shows that the 
optimal allocation is to first make only P- as in lagl, 
and then switch to making both P and M- as in lag2 
(Figure 5A-C). This conclusion applies to a wide range 
of models and parameters and is not sensitive to the 
details of the equations used [46]. 

The exact solution (Figure 5 and Methods section) sug- 
gests that the lag phase duration depends on the initial level 
of bottleneck enzyme P 0 . The smaller P 0 , the longer the lag 
phase because it is advantageous to spend time to produce 
P 0 in order to increase the subsequent rate of biomass pro- 
duction (more precisely, lag duration depends on P 0 loga- 
rithmically: t ~ln(l/P 0 )). If P 0 exceeds a critical amount, 
P oc , it no longer becomes optimal to have a lagl phase - 
the optimal r equals zero (Figure 5D). Instead, the cells 
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Figure 4 Cells enter Iag2 immediately on glucose minimal medium, 
and express ribosomal promoters. Normalized mean GFP fluorescence 
of the cells over time, for different reporter strains in minimal glucose + 
amino-acid medium (M9CGLU). Growth phases are indicated. Data for 
each reporter were normalized by its mean value. Lines are guides to the 
eye, except for the katG reporter, in which the data was smoothed by a 
Gaussian low-pass filter. Error bars indicate standard errors. (A) Normalized 
mean fluorescence of the arabinose system reporters, and katG. 

(B) Normalized mean fluorescence of ribosomal reporters. 

(C) Normalized mean fluorescence of amino-acid genes reporters. 



immediately begin lag2 and produce biomass. This can ex- 
plain why for some sugars, such as glucose, lagl phase is 
not observed. If there are sufficient glucose utilization 
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(I) Optimal control model 
u(t) = fraction of resources devoted for 
biomass production (M). 
l-u(t) = fraction of resources devoted for 
production of bottleneck protein (P). 
The extended Baranyi's model [19]: 



f=<- 

dM 



u)vP 



M 



We optimize u(t) for the fastest possible 
cell doubling in the new medium 
(minimal time to doubling biomass), 
given an initial enzyme levels Po. 

(II) The optimal solution is 'bang- 
bang' control 
u(t) = 0 till time t = z, then u(t)=\. 

The optimal lagl phase duration is: 



f*P 0 
ln(2) 




P exponential increase rate, 



Figure 5 An optimal control model suggests a sudden shift from making bottleneck proteins to full biomass production. (A) The model 
seeks the optimal allocation function u(t) between making bottleneck protein P and making biomass M, in order to minimize the time to double 
the biomass. The function u(t) is the fraction of resources devoted to making M, and l-u(t) the fraction devoted to making P. The rate of 
production of P is parameterized by a maximal rate v. The maximal rate of producing M is the exponential growth rate jL/. A logistic term is 
omitted for simplicity, and including it does not change the qualitative result. The optimal solution for u(t), obtained in the Methods section, is a 
sharp switch from u = 0 to a value of u = 1, at time r. Lagl is the time up to time t, in which the cells make only P. In Iag2 and exponential 
phases, the period of time after t, the cells make both bottleneck protein P and biomass M. (B) The level of the bottleneck protein P. See 
Additional file 1: Figure S3. (C) The level of the biomass M. (D) Lagl phase duration r, as a function of initial bottleneck protein concentration P. 
(E) Lagl phase duration (r), as a function of P exponential increase rate v. 



enzymes in the initial cells, it is better to skip lagl and 
begin growth immediately. 

The model has three parameters, two of which can be 
estimated directly from the data: the exponential growth 
rate fA ~ 0.8 hr" 1 and the bottleneck protein exponential 
increase rate v - 0.5 hr" 1 . From these, and the observed 
duration of lagl phase, r ~ 3 hr, one can estimate the 



initial bottleneck enzyme level in the arabinose experi- 
ment: using the formula t = ^ In [ p^ 2 ) ) 1 m dicating a low 
initial enzyme concentration of P 0 - 0.1 (in units of its 
effective Michaelis-Menten constant). 

Finally, the model also suggests how different individual 
cells might vary in their lagl phase duration. It is known 
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that individual cells differ in the concentration of any 
given protein, an effect known as protein expression noise 
[22-25]. Cell-cell variations in the initial bottleneck en- 
zyme level P 0 will thus result in different lagl durations. 
Usually, the distribution of protein concentrations across 
cells is well described by a Gamma distribution [47], 
which resembles quite closely a log-normal distributions 
when protein copy number is not too low. Indeed, our 
flow cytometry measurements of the araB reporter strain 
show approximately log-normal distributions at early 
times (SI, Additional file 1: Figure S2). Because lag phase 
duration goes as the logarithm of P 0 , the model predicts 
that cell-cell variation in lag duration will roughly follow a 
normal distribution (because the lognormal distribution of 
P 0 mean a normal distribution of ln(P 0 )). This prediction 
agrees with observed distributions of lag times [26,48]. 

Discussion 

We found that the seemingly inert lagl phase shows 
exponentially increasing activity of a few specific promoters. 
To observe this, we developed an assay using E coli re- 
porter strains, which were imaged during flow cytometry, 
to measure promoter activity in lag phase. Using this ex- 
perimental system, we were able to follow cell size and 
promoter activity dynamics over the bacterial growth 
curve. This allowed clear distinction between the two lag 
sub-phases: lagl and lag2. 

With arabinose as the carbon source, we found that 
gene expression in lagl phase (the phase where biomass 
does not grow) is very different from gene expression in 
exponential phase and lag2 phase (the phase in which 
biomass grows but not cell number). It is also different 
from expression in stationary phase before inoculation. 
In lagl, the genes needed to utilize the specific carbon 
source in the medium (arabinose) are expressed with 
exponentially increasing dynamics, despite the fact that 
biomass does not measurably increase. The exponential 
rate of expression in lagl phase is nearly 2-fold slower 
than the exponential rate of cell growth in exponential 
phase. Utilization genes for other sugars are not expressed. 
General growth genes such as ribosomal genes are not 
expressed in lagl phase, and begin to be expressed only in 
lag2 when cell biomass increases. Amino acid biosynthesis 
genes are also not expressed in lagl phase on arabinose. 
In a screen of 140 diverse promoters, the only other 
promoter that rises sharply above its initial level is the 
oxidative stress promoter katG. The resulting picture 
resembles the engineering principle of bang-bang control 
[49-51], in which resources are devoted exclusively at first 
to the bottleneck proteins to provide carbon and energy 
for growth, and only later, at lag2, expression switches to a 
full growth program [52]. 

One may interpret the exponential rise in expression 
of utilization genes in lagl phase by considering a positive 



feedback process. After inoculation from stationary phase, 
the cell contains very few transporters and utilization genes 
for sugars such as arabinose. The influx of arabinose into 
the cell is low, and ribosomal production is prevented by 
the stringent response system [53-56]. The cells seem to 
utilize their meager initial resources [57] to primarily make 
new utilization genes: arabinose transporters and catabolic 
enzymes. In turn, due to these transporters and catabolic 
genes, arabinose influx increases, expression increases in 
turn, and an auto-catalytic process occurs. The exponential 
rate of this process is presumably determined by the rate 
constants of the transporters and enzymes, and the elong- 
ation rate of the translation machinery at low nutrient 
levels. When the cell has enough amino acids to unlock 
stringent response, ribosomal production ensues together 
with biomass growth and exit from lagl (this is in line with 
the finding of no apparent effect of a ArelA AspoT double 
deletion on lag2 in rich medium [34], suggesting that ppGpp 
control of ribosome production is released in lag2). 

A very short lagl phase is expected to be found in 
fresh media in which cells already express the relevant 
utilization genes at high levels in stationary phase. For 
these sugars, such as glucose in the present study, the cell 
comes prepared for almost immediate growth. It would be 
of interest to study the evolutionary circumstances that 
lead cells to choose in which sugar systems to invest 
during stationary phase. 

These findings relate also to theoretical models for lag 
phase dynamics, such as Baranyi's model [19], which are 
widely used in the food safety field to estimate lag phase 
duration in a given food product. Baranyi's model assumes 
that the cells in lagl produce a set of enzymes P at an 
exponential rate v, which provide building blocks for 
biomass production. The present findings may be inter- 
preted as identifying the enzymes P as the sugar utilization 
genes (am genes in the present arabinose medium). In 
many implementations of Baranyi's model to describe 
population growth curves, it is assumed for simplicity 
that the exponential rise in the enzymes P occurs at the 
same rate as the later exponential increase in biomass de- 
noted ft (that is, that v = ft) [19,58]. The present finding 
suggest that in the case of arabinose, P rises exponentially 
but at a different (slower) rate than the exponential rise in 
cell number (v < fi). This finding can lead to refinements 
of modeling for better predictions of lag phase duration. 

The abrupt switch from lagl with its specific expression 
program to lag2 with its general growth program may be 
interpreted in light of optimal control theory. Under a 
wide range of conditions, in order to achieve the greatest 
final product (cells), it is optimal to first make only the 
limiting resources in lagl (bottleneck proteins that utilize 
the carbon source), and then abruptly switch to making all 
of the biomass production machinery in lag2 (Figure 5). 
Mechanistically, this switch might be implemented by the 
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unlocking of stringent control A similar strategy is found 
in wasps that need to allocate resources between making 
workers and making reproductive individuals [49-51]. 
Wasps first make only workers, which bring in nutrients, 
and then, a few weeks before winter, switch to making 
reproductives (rather than making both workers and 
reproductives from the onset at a given ratio). Another 
example is found in development of the mouse gut, 
where stem cells make an abrupt transition between 
symmetric and asymmetric divisions, which has been 
suggested to be the fastest way to reach the desired 
number of differentiated cells [59]. 

Future work can extend the present approach to more 
genes, to fully map the expression strategy in lag phase. It 
would be important to develop further methods to measure 
protein quantities required to start the division cycle, in 
order to develop mechanistic models for bacterial growth 
phases. Testing the role of stringent response in lagl phase, 
and evolutionary experiments to change lag phase duration 
[60] can help identify genes and regulation important for 
this process. Another interesting topic is the role of cell-cell 
variation in lag phase: stochastic effects will result in differ- 
ent individual bacteria making the bang-bang switch at 
different times [26]. Such studies will require following 
individual cells and their promoter activity over time. 

Conclusions 

In this study we developed a method to explore E. coli gene 
expression in lag phase. We distinguish between lagl phase 
in which there is no visible growth, and lag2 in which cells 
grow but have not yet divided. When arabinose is the sole 
carbon source, the vast majority of promoters are inactive 
in lagl, except for the promoters for the carbon source in 
the medium (arabinose) which are activated with exponen- 
tially increasing dynamics. In lag2, E. coli switches abruptly 
to a full growth program of promoter activity. This behav- 
ior is consistent with a bang-bang control strategy of allo- 
cating resources to making bottleneck components first in 
order to optimize biomass production over time. 

Methods 

Strains and plasmids 

E. coli GFP reporter strains (K12 MG1655, with a pUA66 
based reporter plasmid, sclOl ori, kan R , with the gfpmut2 
gene [61]) are from the fluorescent reporter library 
described in [35]. Strain U66 with a promoterless re- 
porter plasmid was used for fluorescence background 
measurements [35,37,62]. All strains are available from 
Open Biosystems (Thermo Fisher Scientific Molecular 
Biology, USA). 

Growth conditions and samples preparation 

The variability in the history of bacterial populations can 
lead to variability in lag phase behavior even within the 



same strain [63-65]. We thus employed procedures to 
reduce lag phase variability between biological repeats and 
within each experiment. Media was filter sterilized using 
0.2-[im pore size filters (FP 30/0,2 CA-S, Whatman, UK; 
and Fast PES filter unit, Nalgene, Thermo Fisher Scientific, 
USA) to reduce false-positive events in the flow cytometry 
experiments, and to avoid autoclave steps that can intro- 
duce variations in pH and in other factors to the medium 
[34,66,67]. The same batches of medium constituents were 
used in all experiments to minimize experimental error. All 
liquid handling and incubation steps were carried out using 
a robotic liquid handler (Freedom Evo, Tecan, Switzerland) 
and incubator (StoreX STX44 ICBT, Liconic Instruments, 
Liechtenstein) system. Incubations with shaking (80 Hz = 
4800 RPM) were carried out at 37°C. Strains from frozen 
stocks were pre-grown for 48 hours in u-shaped bottom 
2 ml 96-deepwell plates (PlateOne, USA Scientific, USA), 
with 600 \A M9 minimal medium containing 0.2% glucose, 
0.05% casamino acids, and 50 (ig/ml kanamycin. This rela- 
tively long incubation time ensures that the cells are in 
stationary phase long enough to reduce the effects of cell 
history, and leads to a sizable lag phase. Longer incubation 
times increase the lag phase cell-to-cell variability pre- 
sumably due to increased stress [20,26,48,63,64,68]. 
We controlled for different amounts of time spent in 
stationary phase pre-inoculation, finding that this variable 
had little effect on the present conclusions. 

The deep well plates were covered with aeration- 
permitting lids (Universal Lid, Seahorse Bioscience, 
USA), to reduce contamination and evaporation. The 
experimental deep well plates were prepared with M9 
minimal medium (final volume 500 \A) containing 50 \ig/ 
ml kanamycin and 0.2% of a certain carbon source (glucose; 
L-arabinose). The glucose medium also contained 0.05% 
casamino acids (Bacto, BD, USA). 

During 15 hours, every 5-180 minutes, a different col- 
umn was inoculated with 8 reporter strains at a 1:750 
dilution from the stationary phase culture. After 15 hours, 
the deep well plates contained cells that were from 5 mi- 
nutes up to 15 hours in fresh medium. This conveniently 
generated a full time-course of growth in one plate, that 
could be analyzed in a single flow cytometry session. One 
column in each plate was not inoculated and was used for 
both medium contamination detection and blank sample 
measurements. Reporter strains from the stationary phase 
cultures were diluted 1:750 into pre-cooled M9, and 
represented the strains at t = 0 (end of stationary phase 
or beginning of lag phase). After the experimental incuba- 
tion, the cells were rapidly cooled to 4°C, diluted according 
to their density (from no dilution at all, up to 1:100 dilu- 
tion), to avoid coincidence in the flow cytometry instru- 
ments [69,70], and transferred into pre-cooled u-shaped 
96-wells plates (Costar, Corning, USA) for the automated 
flow cytometry analysis. All samples were measured at 
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most 4 hours after the growth experiments ended. The 
same samples were analyzed both using the LSRII and the 
ImageStream x instruments. 

No fixation or staining were used as these procedures 
potentially affect the side scatter (SSC), forward scatter 
(FSC), and GFP values of bacterial cells, as well as the 
bacterial count [70-72]. Moreover, the added steps neces- 
sary for these procedures might increase variability between 
the samples and between the experiments. 

Flow cytometry measurements 
Standard flow cytometry 

20 [A sample at a flow rate of 1 \A/s were analyzed per 
sample using a LSRII (BD, USA) flow cytometry instrument 
(settings: FSC 648 mV, SSC 346 mV, GFP 450 mV, thresh- 
old collection FSC > 200) at 4°C. The cells' FSC, SSC and 
GFP values were collected. The events rate for the bacterial 
samples was between 1,000-9,500 events/s, and did not 
come near the upper limit of the instrument (20,000 
events/s), to avoid coincidence [69,70]. A total of 20,000- 
190,000 events were collected per sample. Blank samples 
with medium only were used in each experiment to exclude 
contaminations, and to calculate background event num- 
bers, with an average event rate of 50 events/s (Additional 
file 1: Figure S5). FSC & SSC were used for gating. In 
contrast to most flow cytometry experiments that usu- 
ally use narrow gating to achieve measurements of a 
homogenized population, a wide gating was used in this 
study (Additional file 1: Figure S6). This large gating 
allowed measuring the entire population, and especially to 
monitor cell size and cell number. The number of back- 
ground events in the gated area was negligible, between 
0.3-3% of the total number of gated events per sample. The 
mean day-to-day error between fluorescence measurements 
for the arabinose system reporters was 9%, and below 17% 
for cell count. 

Bacterial growth curves were calculated by the following 
procedure: In each well, the same liquid volume and liquid 
flow rate were used to count the number of events (cells) [73]. 
Combined with the known dilution coefficients (from the ori- 
ginal experimental deep well plates), we calculated bacterial 
cell counts at each time point for each reporter strain. 

Imaging flow cytometry 

Around 4000 events were collected per sample using the 
ImageStream x imaging flow cytometry instrument and 
Inspire software (Amnis, Seatle, USA). The cells' bright 
field (BF), SSC & GFP images were taken at an event rate 
of 5-500 cells/s. The instrument uses beads to track the cell 
flow and to focus on the cells. Bead images were ignored 
during image acquisition or removed from the analysis to 
increase image analysis accuracy. Instrument settings for 
image collection: raw_max_pixel_SSC < 200 (ignored most 
of the bead images), raw_max_pixel_GFP < 4094 (ignored 



saturated GFP images), raw_min_area_BF > 1 square |im 
(>9 pixels, this removed small objects suspected as dust 
and cell fragments). Image analysis was performed using 
ImageStream x analysis software IDEAS 4.0 (Amnis, Seatle, 
USA). Gating: more beads were removed by gating with 
mean_pixel_BF and mean_pixel_SSC, unfocused cells were 
removed by gating using gradient_RMS_BF, and more 
beads were removed by gating with aspect_ratio_intensi- 
ty_BF min_pixel_BF [74]. Flow aligns the cell long axis 
with the flow direction, so that length measurements in 2D 
images are an accurate indication of true length [75,76]. 
Measurements of mean cell length had a mean day to day 
error of 3%. 

Standard flow cytometry and imaging flow cytometry 
analysis were conducted on the same samples. All experi- 
ments were repeated at least twice. 

Optimal control model 

We model the lag phase enzyme dynamics using the 
framework of Baranyi and Roberts [19]. The cells make 
biomass M at a rate that depends on a bottleneck enzyme 
(representing for example transporters and catabolic 
enzymes for arabinose utilization) whose concentration is 
P. The rate of biomass production is dependent on P in a 
Michaelis-Menten form: 



dM 
dt 



M 



(1) 



Where ft is the maximal growth rate, and P is given in 
units of its effective halfway concentration, so that at P = 1 
biomass production is half-maximal. The function u(t) is 
the proportion of resources that the cell devotes to biomass 
production, which can vary between one and zero. 

The bottleneck enzyme P is produced at rate vP, times 
the fraction of resources the cell devotes to it, namely 1-u: 



dP 
dt 



(l-u)vP 



(2) 



The objective in the optimization problem we study 
is to divide rapidly in the new condition. We thus seek 
the control profile u{t) that will double the biomass in 
the minimal amount of time, T, Initial conditions are 
P(t = 0) = P 0 , the amount of enzyme present in the inocu- 
lated cells, and initial biomass M(t = 0) = M 0 . 

According to Pontryagins maximum principle [45,46], 
such minimal time problems are solved by the profile u{t) 
that maximizes at every time point the Hamiltonian: 



i dM dP 

H = ~ l ^lti^Tt 



(3) 



-l+XiM(i 



iX+P) 



M + \ 2 (l-u)vP 
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The Lagrange multipliers Xj(t) and \ 2 {t) follow Hamilton's 
equations: 



d\ x _ m d\ 2 _ dH 
~di ~ ~dM'~dt ~ ~ BP 



(4) 



Because the Hamiltonian H is linear in u (Eq 3), it can 
be maximal only at the extreme values that u can take, 
namely zero and one. This means bang-bang control: 
devote all the cell resources to either M or P, with sharp 
switches between these states allowed. Bang-Bang control 
is expected with any Hamiltonian function that is linear or 
convex in u, a very wide class of possible models [46]. 
Analyzing the equations for X 2 and X 2 shows at most a 
single switch between the extreme values of u can occur 
in the present model. 

Assuming a single switch between making only P and 
then switching to make only M, with switching at time r, 
one can solve Eq 2 with u = 0 to find 



P(t) = P 0 e vx 



(5) 



From time r to the division time T, one has u = 1, so P 
is constant = P(t) and M increases exponentially: 



M(T) =M 0 e l 



„ EW) (T-t) 

^(1+P(t))V J V 



(6) 



We seek the time T where biomass doubles, M(T) = 2 
M 0 . Solving for T results in 



T 



v- V 



i 



P 0 e v 



(7) 



Finding the lag duration r that minimizes the time to 
first division T, namely dTVdr = 0, results in the following 
solution: 



where 



v Vv, 



fiP 0 



In(2) 



(8) 



(9) 



Thus, t is positive only when v > v c . It is equal to zero 
for v < v a meaning no lagl phase and a direct transition to 
lag2 (Figure 5E). The lag phase duration r is maximal 
when v = e • v c , where e = 2.71. Another way to consider 
this result is that for a given v and there is a critical ini- 
tial enzyme level P 0 above which there is no lagl: P oc = In 
(2)v/pc. For the present arabinose parameters, P 0C ~0A. 
The duration of lag2 in this model is log(2)/^ + II v ~ 3 h, 
which is shorter than the observed arabinose lag2 phase 
duration, ~5 h. This discrepancy might be explained by 
using a stochastic model in which lag phase differs 
between cells. Note that the experimental data (Figure 3) 
shows increases of total P in lag2 (fluorescence of the ara 



promoters), whereas the model shows that P concentra- 
tion becomes constant at lag2. These two observations are 
consistent, because biomass grows in lag2, and therefore 
constant P concentration means an increase in total P 
protein as observed in the experiment. 

Additional files 



Additional file 1: Supplemental information. Supporting figures and 
supporting tables. 

Additional file 2: Screen data. 140 promoters normalized screen data. 

Additional file 3: Screen plots. 140 promoters normalized screen data 
plots. 
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